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ABSTRACT 

f-*) ■ We present a quantitative analysis of the astrophysical and cosmological information 

that can be extracted from the many important wide-area, shallow surveys that will 
, be carried out in the next few years. Our calculations combine the predictions of the 

C<") ' physical model by Granato et al. (2004) for the formation and evolution of spheroidal 

\ galaxies with up-to-date phenomenological models for the evolution of starburst and 

C " ■ normal late-type galaxies and of radio sources. We compute the expected number 

counts and the redshift distributions of these source populations separately and then 
^ , focus on proto-spheroidal galaxies. For the latter objects we predict the counts and 

O redshift distributions of strongly lensed sources at 250, 350, 500, and 850 /im, the 

angular correlation function of sources detected in the surveys considered, the angu- 
lar power spectra due to clustering of sources below the detection limit in Herschel 
and Planck surveys. An optimal survey for selecting strongly lensed proto-spheroidal 
galaxies is described, and it is shown how they can be easily distinguished from the 
other source populations. We also discuss the detectability of the imprints of the 
1-halo and 2-halo regimes on angular correlation functions and clustering power spec- 
tra, as well as the constraints on cosmological parameters that can be obtained from 
the determinations of these quantities. The novel data relevant to derive the first 
sub-millimeter estimates of the local luminosity functions of starburst and late-type 
galaxies, and the constraints on the properties of rare source populations, such as 
blazars, are also briefly described. 

Key words: gravitational lensing - galaxies: evolution - cosmology: observations - 
submillimetre. 
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1 INTRODUCTION 

So far, surveys at sub-millimeter (sub-mm) wavelengths 
have covered only tiny fractions of the sky (less than 
ldeg 2 ) down to mjy levels (Mortier et al. 2005, and ref- 
erences therein). Forthcoming experiments, however, will 
enormously extend the sky coverage. The Legacy Surveys 
with SCUBA-a3 include a Shallow Survey (SASSy) of ~ 
20,000deg 2 to a depth of 150 mjy (5a) and a Wide- Area 
Extragalactic Surve>0 of 5 deg 2 to a 5a flux limit of 2 mjy, 



both at 850^m. The SPIRE instrument on ESA's Her- 
SCHEL satellite may be able to carry out a shallow survey 
of ~ 400 deg 2 to a 100 mjy level (5a) in all three bands 
(250, 350, and 500^m; Lagache et al. 2003; Harwit 2004). 
The HFI instrument (Lamarre et al. 2003) on the Planck 
satellite will carry out all sky surveys in 6 bands, centered 
at 3000, 2100, 1380, 850, 550, and 350 pm; these surveys 
will be confusion limited to flux levels ranging from a few 
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2 After this paper was submitted, we learned that the 
Wide-Area Extragalactic Survey at 850/im has been 
combined with the Deep 450^tm survey. The joint pro- 



gramme, known as the SCUBA-2 Cosmology Legacy Survey 
(www.strw.leidenuniv.nl/~pvdwcrf/pdf/SCUBA2-CLS.pdf), has 
two elements: an 850^tm survey of ~ 35 deg 2 to a 5cr limit of 3.5 
mjy and a deeper 450/nn survey of ~ 1.5 deg 2 to a 5<r limit of 
2.5 mjy. 
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hundred mjy to ~ 1.5 Jy (Vielva et al. 2003; Negrello et al. 
2004; Lopez-Caniego et al. 2006). 

As first pointed out by Blain (1996), the sub-mm region 
is exceptionally well suited to exploit strong gravitational 
lensing to detect large complete samples of high-redshift 
dusty galaxies. The reason resides in the combined effect of 
the large and negative K-correction, that makes the observed 
flux density of sources of given luminosity only weakly de- 
pendent on distance over a broad redshift range above z £ 1 

(Blain & Longair 1993), and of very steep evolution. These 
two ingredients cooperate in yielding large lensing optical 
depths and extremely steep counts, resulting in a strong 
magnification bias. 

Strong lensing is a powerful tool for many interesting 
applications (see Kochanek 2004 for a comprehensive re- 
view). The statistics of lenses is a test of the cosmologi- 
cal model because the optical depth is roughly proportional 
to the volume to the source. If the redshifts of both the 
source and the lens are measured, the ratio between the 
lens-source and observer-source angular diameter distances, 
dia/dos, constitutes a sensitive probe of the geometry of the 
Universe (and in particular of ft a, with weak dependence 
on fl m ) through the scaling of light-ray deflections with the 
source redshift. To date, in the best studied cases, the lens 
is a low redshift cluster such as A1689 or A2218, and this 
test is difficult to apply because, for high redshift sources, 
di s /d os is always ~ 1 and the effect of different cosmologies 
is very small. On the contrary, in the case of the sub-mm 
selected sources, the lenses are expected to be generally at 
substantial redshifts. The distribution of lens redshifts pro- 
vides a further, potentially powerful, cosmological test since, 
in general, cosmologies with a large cosmological constant 
predict significantly higher lens redshifts than those with- 
out. Recent analyses of lens statistics have focussed on the 
CLASS survey (see e.g., Mitchell et al. 2005) to derive con- 
straints on cosmological parameters. Attempts to use the 
lens statistics to constrain dark energy have been worked 
out by Chae et al. (2004) and Kuhlen et al. (2004). 

On the other hand, the probability that a source has 
an intervening lens also depends on the distribution of the 
lens galaxies, and the cross section of the lens for producing 
a given effect depends on its velocity dispersion. Thus, if a 
cosmological model is adopted, the lens statistics is informa- 
tive on the evolution of galaxy properties, and in particular 
of dark matter structures. Note that gravitational lenses are 
unique in providing a selection based on mass, rather than 
luminosity, so that the separation distribution of lenses is a 
direct mapping of the mass function of all halos, luminous 
or dark. 

Alexander et al. (2003, 2005) have found that a sub- 
stantial fraction of sub-mm bright galaxies harbor an ac- 
tive galactic nucleus (AGN). The magnification produced 
by gravitational lensing allows us to study fainter AGN host 
galaxies than otherwise possible. This will permit a better 
understanding of the growth of super-massive black holes 
and their relationships with their host galaxies. 

Another important driver of large-area sub-mm surveys 
is the investigation of the clustering properties of high-z 
galaxies, both in the 1-halo and in the 2-halo regimes, that 
can be performed more effectively than in other wavebands, 
and without being affected by obscuration effects. The large 



scale angular correlation function of detected sources and 
the autocorrelation function of intensity fluctuations pro- 
vide information on the masses of dark matter halos, on the 
duration of the star-formation process and on the evolution 
of the bias factor. They are also sensitive to, and will provide 
constraints on cosmological parameters. 

Furthermore, these large area surveys will yield com- 
plete samples of sub-mm selected low-z dusty galaxies, al- 
lowing, for the first time, direct estimates of the local lumi- 
nosity functions. In addition, they may contain a number of 
extreme luminosity distant galaxies. 

Another interesting, although only seldom mentioned, 
product of large-area sub-mm surveys are complete samples 
of radio sources, selected in a spectral region almost unex- 
plored so far, but where key information on their physics 
is expected to show up. For example, observations at these 
wavelengths may reveal the transition from optically thick 
to optically thin synchrotron emission in the most compact 
regions, allowing the determination of the self-absorption 
(turnover) frequency which carries information on physical 
parameters. Also, the slope of the optically thin synchrotron 
emission steepens at high frequencies due to electron en- 
ergy losses; the spectral break frequency v b is related to 
the strength of the magnetic field and to the 'synchrotron 
age' t a by v b ~ 320(20piG/B) 3 (t s /Myr)~ 2 GHz. Excess far- 
IR/sub-mm emission possibly due to dust, is often observed 
from radio galaxies (Knapp & Patten 1991). These surveys 
will allow us to assess whether this is a general property of 
these sources, with interesting implications for the presence 
of dust in the host galaxies, generally identified with giant 
ellipticals, usually devoid of interstellar matter. 

The dominant radio source population at the relevant 
frequencies is made of flat-spectrum radio quasars and of BL 
Lac objects, collectively dubbed 'blazars'. Multifrequency 
surveys in the far-IR/sub-mm region will allow us to check, 
e.g., whether there are systematic differences in the turnover 
frequencies between BL Lacs and flat-spectrum quasars as 
expected if the BL Lac emission is angled closer to the line 
of sight, so that the turnovers are Doppler boosted to higher 
frequencies. 

Correlations between turnover frequency and luminos- 
ity, which is also boosted by relativistic beaming effects, may 
be expected. On the other hand, Fossati et al. (1998) and 
Ghisellini et al. (1998) found an anti-correlation between 
the frequency of the synchrotron peak of blazars and their 
radio luminosity, referred to as 'the blazar sequence'. The 
spectral energy distributions of the most luminous blazars 
appear to peak at (rest-frame) mm wavelengths, while the 
peak moves towards optical wavelengths as the luminos- 
ity decreases. This sequence, if confirmed, is an important 
breakthrough in our understanding of the physical processes 
governing the blazar emission. On the other hand, Anton & 
Browne (2005) find that most of their low-luminosity flat- 
spectrum radio sources peak at much lower frequencies than 
expected from the Fossati et al. (1998) relationship. Also, 
Nieppola et al. (2006) did not find any correlation between 
luminosity and frequency of the synchrotron peak for their 
sample of 300 blazars. However, only a limited region of the 
luminosity-peak frequency plane has been explored to date, 
mostly due to the dearth of data in the far-IR to mm region, 
and the samples have been collected from different surveys 
and are not complete. 
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Figure 1. Comparison of the redshift distribution of sources with 
Sfeoum > 5 mjy yielded by the Granato et al. (2004) model 
(solid histogram) with the observational estimate by Chapman 
et al. (2005; dot-dashed). The numbers of sources per bin of 
Chapman et al. (2005) have been rescaled to a surface density 
N(Sg 5 0vm > 5mJy) = 620deg~ 2 (Scott et al. 2002). As pointed 
out by Chapman et al. (2005), the dip at z ~ 1.5 and the high-z 
cut-off in the data are likely due to the radio and optical selec- 
tions. 



In this paper we address all these important aspects. 
After an overview of the expected counts of extragalactic 
sources at sub-mm wavelengths (§2), we present a compara- 
tive study of the potential of these surveys in relation to: the 
selection of strongly lensed, dusty primordial galaxies (§3), 
the clustering properties of sub-mm sources at high redshifts 
(§ 4), and the selection of the first complete sub-mm samples 
of local star-forming galaxies and of radio sources, mainly 
blazars (§5). In § 6 we summarize and discuss our main con- 
clusions. This investigation may help in guiding us on what 
to expect and in optimizing the survey programmes. 

Throughout this paper we will adopt the "concordance 
model", i.e. a flat ACDM cosmology with fl m = 0.27, Oj, = 
0.045, and Ov = 0.73, H = 72 km s" 1 Mpc" 1 , <r 8 = 0.8, 
and an index n = 1.0 for the power spectrum of primordial 
density fluctuations. 



2 OVERVIEW OF THE MODELS AND 
SUB-MM COUNTS 

Sub-millimeter extragalactic sources are a mixed bag of var- 
ious populations of dusty galaxies and of flat-spectrum radio 
sources. A summary of the models adopted for each popu- 
lation follows. 



2.1 Proto-spheroidal galaxies 

Dusty galaxies detected at mjy levels by SCUBA and 
MAMBO surveys are modelled following Granato et al. 
(2004), who interpret them as proto-spheroidal galaxies in 
the process of forming most of their stars in a gigantic star- 
burst lasting 0.3 - 1 Gyr (depending on redshift and on the 
halo mass). This physical model predicts the birth rate of 
galaxies as a function of cosmic time and the evolution with 
galactic age of their full spectral energy distribution (SED) , 
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Figure 2. Model 160 fim integral counts compared with the 
Spitzer counts by Dole et al. (2004; squares) and Frayer et al. 
(2006a; triangles). The heavy solid line on the lower left-hand 
corner shows the count estimate by Serjeant & Harrison (2005). 



from radio to X-ray wavelengths, including the effect of the 
active nucleus, whose growth is closely tied to the evolution 
of the host galaxy. It is thus strongly constrained by, and 
indeed accounts for huge amounts of multifrequency data at 
different redshifts (Silva et al. 2005; Cirasuolo et al. 2005; 
Shankar et al. 2006; Granato et al. 2006; Lapi et al. 2006). 
In particular, it fits the 850 /im counts available at the time 
(Granato et al. 2004) and the data on the redshift distribu- 
tion (Fig. [[]). 

However, the re-analysis of existing blank-field SCUBA 
surveys by Scott et al. (2006) and the results of the SCUBA 
Half-Degree Extragalactic Survey (Coppin et al. 2006) con- 
verge in yielding lower counts than previous studies. To 
match these more recent results, we have scaled the 850 /im 
fluxes of Granato et al. (2004) by a factor of 0.68, thus pre- 
serving the agreement with the shape of the redshift distri- 
bution. 

Most recently, Aretxaga et al. (2007) compared the red- 
shift distribution of the SCUBA Half Degree Extragalactic 
Survey (SHADES), derived from rest-frame radio-mm-far 
IR colours, with those yielded by the four different galaxy 
formation models studied by van Kampen et al. (2005) and 
with the semi-analytic models by Baugh et al. (2005) and by 
Granato et al. (2004). Only the latter model turned out to 
be compatible, within the uncertainties, with the data, al- 
though if sources detected only at 850 /im are introduced in 
the redshift probability distribution with other priors, two 
of the models considered by van Kampen et al. (2005) can 
come closer to acceptable. 



2.2 Starburst and normal disk galaxies 

The Granato et al. (2004) model is meant to take into ac- 
count the star formation occurring within galactic dark- 
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Figure 3. Predicted counts of high-z proto-sphcroidal galaxies (solid line), late-type (starburst plus normal spiral) galaxies (dot-dashed 
line), and radio sources (dotted line). The dashed line shows the predicted counts of strongly lensed proto-sphcroidal galaxies. The 850 fim 
data are from Coppin et al. (2006; circles) and Scott ct al. (2006; squares). The heavy solid line on the lower right-hand corner of each 
panel shows the count estimates by Serjeant & Harrison (2005). 



matter halos virialized at z v i r £1.5 and bigger than M v ; r ~ 

10 11,6 Mq, which are, crudely, associated to spheroidal galax- 
ies. We envisage disks (and irregulars) as associated primar- 
ily to halos virializing at z v i r £ 1.5, which have incorporated, 

through merging processes, a large fraction of halos less mas- 
sive than 4 x 10 11 Mq virializing at earlier times, which may 
become the bulges of late type galaxies. 

The contributions of the latter galaxies to the number 
counts are estimated following, as usual, a phenomenolog- 
ical approach, which consists in simple analytic recipes to 
evolve their local luminosity functions (LFs) , and appropri- 
ate templates for their SEDs to compute K-corrections and 
to extrapolate the models to different wavelengths where di- 
rect determinations of the local luminosity functions are not 
available. We have adopted the 60 pm local LFs by Saunders 
et al. (1990), that have been derived for "warm" and "cold" 



galaxies (based on IRAS colors) that we associate, respec- 
tively, to starburst and spiral galaxies. Transformations to 
longer wavelengths were done using the mean flux ratios de- 
termined by Serjeant & Harrison (2005) who extrapolated 
the IRAS detections using the sub-mm colour temperature 
relations from the SCUBA Local Universe Survey (Dunne 
et al. 2000; Dunne & Eales 2001). 

Rather detailed information on dusty galaxies at red- 
shifts ^ 1.5 came from the ISO CAM cosmological surveys 
performed with the LW3 filter, centered at 15 /»m (see La- 
gache et al. 2005 for a review). Various phenomenological 
models proposed to account for these data have been com- 
pared and discussed by Gruppioni et al. (2002), who showed 
that the data require a combination of luminosity and den- 
sity evolution for starburst galaxies, stopping at some red- 
shift 2t>reak ~ 1. Gruppioni et al. (2002) also allowed for 
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non-evolving normal spirals and for type 1 active galactic 
nuclei evolving in luminosity. The latter population, how- 
ever, yields a very small contribution to the 15 /jm counts 
(as well as to the sub-mm counts) and can be neglected 
here. On the other hand, chemo/spectrophotometric evolu- 
tion models for normal disc galaxies (Mazzei et al. 1992; 
Matteucci 2006) indicate a mild increase of the star for- 
mation rate, hence of IR luminosity, with look-back time. In 
view of that, Silva et al. (2004) updated the Gruppioni et al. 
(2002) model by allowing for a moderate luminosity evolu- 
tion of normal spiral galaxies. They obtained good fits to the 
observed 15 /im counts and redshift distributions with the 
following recipe: i) the LF, $ s b, of the starburst population 
evolves both in density and in luminosity as & s h[L(z), z] — 
$ Eb [L(z)/(l + z) 2 - 5 ,z = 0] x (1 + z) 3 ' 5 , with a high-luminosity 
cutoff of the local LF at Leo^m = 2 x 10 32 erg/s/Hz; ii) 



0.15 



spirals evolve in luminosity as L(z) = 1/(0) (1 + 



\1.5. 



iii) 



the evolution of both starburst and spiral galaxies stops at 
Zbrcak = 1 and the LFs keep constant afterwards, up to 
a redshift cut-off z cuto B = 1-5, above which essentially all 
the massive halos are associated, according to Granato et 
al. (2004), with spheroidal galaxies. We have adopted this 
model. 

As shown by Silva et al. (2004, 2005), the model ac- 
counts for a broad variety of data, including optical, ra- 
dio, near-, mid-, and far-IR counts, and redshift distri- 
butions. Moreover the model predicted, at the flux limit 
(fWm ^ 83 /iJy) of the Chandra Deep Field South MIPS 
survey (Papovich et al. 2004), a surface density of sources 
at z ^ 1.5 of ~ larcmin 2 , i.e. amounting to ~ 22% of the 
observed total surface density. This prediction was at vari- 
ance with those of major phenomenological models which, 
for that flux limit, yielded (see Fig. 2 of Perez-Gonzalez et 
al. 2005 and Fig. 6 of Caputi et al. 2006) either very few 
(Chary et al. 2004) or almost 50% (Lagache et al. 2004) 
sources at z ~ 1.5. The redshift distribution observation- 
ally determined by Perez- Gonzalez et al. (2005) and Caputi 
et al. (2006) for F 24fl m ^ 83 /^Jy has 24-28% of sources at 
z ^ 1.5, in nice agreement with the Silva et al. (2004) model. 
We add here (Fig. a comparison with Spitzer counts at 
160 ^m (Dole et al. 2004; Frayer et al. 2006a), the wave- 
length closest to those relevant here, confirming once more 
the consistency of the model with the data. 



2.3 Extragalactic radio sources 

For radio sources we have adopted the model by De Zotti 
et al. (2005). In the wavelength range and at the relatively 
bright flux densities considered in this paper, the dominant 
population are blazars (flat-spectrum radio quasars and BL 
Lac objects). The usual power-law approximation of the 
spectra of these objects may break down at sub-mm wave- 
lengths, where the synchrotron emission of even the most 
compact regions turns from the optically thick to the opti- 
cally thin regime and is further steepened by electron ageing 
effects. To allow for the corresponding steepening of radio 
spectra, we have adopted the spectral shape given by Donato 
et al. (2001) and Costamante & Ghisellini (2002), including 
their (controversial, see Anton & Browne 2005 and Niep- 
pola et al. 2006) anti-correlation between break frequency 
and radio luminosity (see §[5]). 
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Figure 4. Redshift distribution of spiral (dotted) and starburst 
(dashed) galaxies brighter than 100 mjy at 550 fim, for Az = 
0.01, according to the model described in the text. 



2.4 Sub-mm counts 

Figure [3] summarizes the expected contributions of proto- 
spheroids, of late-type (starburst plus spiral) galaxies, and 
of radio sources to the counts at 850, 550, 350, and 250 /im, 
yielded by the models described above. The 850 /im panel 
shows that the SCUBA counts are almost entirely accounted 
for by proto-spheroidal galaxies, with a minor contribution 
from starburst galaxies, which, however, become increas- 
ingly important at low flux densities. At bright fluxes, the 
counts of dusty galaxies match the extrapolation of IRAS 
data by Serjeant & Harrison (2005). At the brightest lev- 
els, however, the counts may be dominated by flat-spectrum 
radio sources. 



3 STRONGLY LENSED PROTO-SPHEROIDAL 
GALAXIES 

As already mentioned, the combination of the very steep 
counts (Fig. [3]) and high redshifts (Fig. [IJ, hence large lens- 
ing optical depths, of sources detected by SCUBA surveys 
maximizes the fraction of strongly lensed sources. 

Calculations have been carried out as in Perrotta et al. 
(2002, 2003), but with the evolutionary model for proto- 
spheroidal galaxies by Granato et al. (2004), updated as 
described above. The predicted counts of strongly lensed 
sources (defined as those with a gravitational amplification 
by a factor ^ 2) are displayed in Fig. We find that the frac- 
tion of strongly lensed proto-spheroids can indeed be very 
high at relatively bright sub-mm fluxes, where the surface 
density of unlensed proto-spheroidal galaxies sinks down 
rapidly. Dunlop et al. (2004) argue that the amplification 
distribution derived by Perrotta et al. (2003) is probably 
low because it neglects the effect of the halo substructure. If 
so, the fraction of strongly lensed sources in sub-mm surveys 
is even higher than shown in Fig. [3] 
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Figure 5. Predicted redshift distributions in bins of Az = 0.36 of strongly lensed galaxies brighter than 100 mJy in the 3 Her- 
SCHEl/SPIRE channels (250, 350, and 550 /im) over an area of 400 deg 2 , and brighter that 150 mJy (the flux limit of the SCUBA-2 
SASSy survey) at 850 ^tm over an area of 20, 000 deg 2 , based on the Granato et al. (2004) model, updated as described in the text. 



Clearly, large area sub-mm surveys can provide far 
richer samples of strongly lensed sources than currently 
available. However, Planck/HFI is confusion limited at 
bright flux levels. Estimates of the 5a detection limits have 
been obtained by Lopez-Caniego et al. (2006), using up-to- 
date templates of the microwave/sub-mm sky, but ignoring 
source clustering. The effect of latter on the confusion noise 
has been discussed by Negrello et al. (2004), using models 
fitting the earlier 850 ^m counts. We have revised down- 
wards the estimates by Negrello et al. (2004) based on the 
counts shown in Fig. [3] Combining these results with those 
by Lopez-Caniego et al. (2006) we obtain 5a detection lim- 
its of ~ 335, 740, and 1290 mJy at 850, 550, and 350 pm, 
respectively. Note that the analysis by Lopez-Caniego et al. 
(2006) refers to high Galactic latitude regions (|6| > 30°); 
at lower latitudes the detection limits increase because of 
the larger contributions of Galactic emissions to foreground 
fluctuations. Therefore, Planck/HFI can possibly pick up 
only the most extreme (very rare) high-z sources. 



The situation is far better for Herschel/SPIRE. For 
example, the surface density of strongly lensed galaxies at 
a flux limit of 100 mJy is estimated to be ~ 0.14 deg~ 2 at 
500 fim. Thus a survey of, say, 400 deg 2 to this flux limit will 
provide a complete sample of ~ 60 strongly lensed galax- 
ies. The SASSy will provide information on strongly lensed 
sources several times brighter (in terms of bolometric lu- 
minosity) and much rarer (surface density ~ 0.002 deg -2 ), 
which however will be detected in a comparable number 
(~ 40) thanks to the much larger area (20, 000 deg 2 ). For 
comparison, the radio Cosmic Lens All-Sky Survey (CLASS; 
Browne et al. 2003; Myers et al. 2003), that has been the ba- 
sis for most analyses of lens statistics, has yielded a sample 
of 22 gravitational lens systems. 

The other source populations contributing to the counts 
are easily distinguished from (lensed and unlensed) proto- 
spheroidal galaxies. Essentially all radio sources are flat- 
spectrum blazars, already detected by low-frequency sur- 
veys, like the 1.4 GHz NVSS (Condon et al. 1998) and 
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FIRST (Becker et al. 1995; White et al. 1997), complete 
to a few mjy levels, the 0.84 GHz SUMSS (Mauch et al. 
2003), with completeness limits ranging from 8 to 18 mjy, 
and the 5 GHz GB6 survey (Gregory et al. 1986), complete 
to 18 mjy, and PMN survey (Griffith & Wright 2003) with 
completeness limits ranging from 20 to 72 mjy. Altogether 
these surveys cover the full sky. The median spectral index 
of blazars between 5 and 18.5 GHz is (Ricci et al. 2006) 
a ~ 0.16 (S„ oc v~ a ) and generally further steepens at 
higher frequencies. Therefore the radio flux of blazars is sig- 
nificantly higher than at 500 /im, and thus well above the 
completeness limits of the quoted surveys; obviously, the low 
frequency fluxes of steep-spectrum sources are even higher. 

The starburst and normal late-type galaxies sampled 
at the relatively bright flux limits considered here are 
at z <C 1 (see Fig. [4}. For the median local ratios 

5*60^111 /S250//m — 1, 5'60Aim/>S'350/jm — 2.4, SeO^m/Ssso^m — 9, 

and Seoum/ SsBOfim — 41 (Serjeant & Harrison 2005, based 
upon observational data by Dunne & Eales 2001) these 
sources have 60 /an fluxes above the completeness limit 
of the IRAS survey, Seo/jm — 0.6 Jy, for A > 500/mi, 
and well above that of the AKARI (formerly ASTRO- 
F) all-sky survey, Seo/j.m — 44mJy (Pearson et al. 2004), 
at all wavelengths. Combining SPIRE and/or SCUBA-2 
with IRAS and AKARI data, photometric redshift esti- 
mates can be obtained. On the contrary, lensed and un- 
lensed proto-spheroidal galaxies, at typical z ~ 2-3, have 

<S , 60fim/S'550 ( im <IC 1. 

Also, starburst and normal late-type galaxies are ex- 
pected to have optical counterparts in the available digitized 
sky surveys such as SuperCOSMOS, APM, APS, DSS, and 
PMM. In fact only 3.5% of the galaxies the IRAS PSOz sam- 
ple (Saunders et al. 2000), complete to S^o^m = 0.6 mjy over 
84% of the sky had no identifications with bj ^ 19.5 mag. 
On the other hand, star-forming proto-spheroidal galaxies 
are expected to be optically very faint (R > 25) because 
of their substantial redshifts bringing optical bands to the 
rest-frame UV where obscuration is very high. 

At 500 and 850 /im, unlensed proto-spheroids have es- 
sentially disappeared above 100 mjy so that, after having 
removed radio sources and low- z galaxies as discussed above, 
we are left with a sample almost exclusively made of strongly 
lensed sources, i.e. we expect an efficiency in the selection 
of such sources close to 100%. For comparison, the CLASS 
survey found 22 gravitational lens systems out of over 16000 
radio sources. 

The surface density of strongly lensed sources brighter 
than 100 mjy in the two other Herschel/SPIRE channels 
is higher than at 500 /im (~ 0.6 deg -2 at both 250 /im and 
at 350 /im, corresponding to ~ 240 strongly lensed galax- 
ies over 400 deg 2 ). It is however lower than that of unlensed 
proto-spheroids, so that singling out strongly lensed sources 
is not straightforward, although at 350 /im the ratio of lensed 
to unlensed sources is still high (~ 0.4). Also, only a fraction 
of starburst /late type galaxies selected at these wavelengths 
are expected to show up in the IRAS catalog. On the other 
hand, the different selection wavelengths provide comple- 
mentary information, since, as illustrated by Fig. \5\ they 
probe different redshift intervals, systematically shifted to 
higher values with increasing wavelength. 

The SCUBA-2 "Wide area extragalactic survey" should 
also detect hundreds of strongly lensed sources. In this case 
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Figure 6. Comparison of the angular correlation function of 
SHADES sources predicted by the model adopted in this paper 
(solid line with lrr error bars) with the best fitting power laws of 
the sky-averaged correlation functions predicted by the 4 models 
studied by van Kampen et al. (2005). 

however, they are only ~ 2% of the total number of high- 2 
galaxies, and detailed follow up observations would be nec- 
essary to single them out. Herschel/SPIRE surveys going 
substantially deeper than 100 mjy will face a similar prob- 
lem. 

Clearly, the angular resolution of both Her- 
SCHel/SPIRE and SCUBA-2 is far too poor to resolve 
the multiple images of lensed sources, so that optical/IR 
follow-up is necessary to see the multiple images. Note 
that the sources considered here are brighter, even by large 
factors, than the two highly magnified sub-mm sources 
discovered by Kneib et al. (2004) and Borys et al. (2004), 
so that the follow-up measurements will be correspondingly 
easier. 



4 CLUSTERING OF PROTO-SPHEROIDAL 
GALAXIES 

Both theoretical arguments and observational data indicate 
that the positions of powerful far-IR galaxies, detected by 
SCUBA surveys and interpreted as massive proto-spheroidal 
galaxies, are highly correlated (see e.g. Smail et al. 2003; 
Negrello et al. 2004; Blain et al. 2004; Scott et al. 2006). 
On the contrary, Poisson fluctuations dominate in the case 
of the other extragalactic source populations contributing to 
the sub-mm counts, whose clustering is relatively weak, as in 
the case of spiral and starburst galaxies (cf. e.g. Madgwick et 
al. 2003) or highly diluted because of the very broad redshift 
distribution, as in the case of radio sources (Magliocchetti et 
al. 1999; Blake & Wall 2002a,b; Overzier et al. 2003; Blake 
et al. 2004; Negrello et al. 2006). 

In the following we will therefore consider only the 
clustering of dusty proto-spheroidal galaxies. As already 
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mentioned, they are located at substantial ( £ 2) redshifts 

(Chapman et al. 2005) and display huge star formation rates 
( Si 1000 M Q /yr), allowing masses in stars of the order of 

10 11 Mq to be assembled in times shorter than 1 Gyr. 

For dark matter to stellar mass ratios typical of massive 
ellipticals (see, e.g., Marinoni & Hudson 2002; McKay et al. 
2002), the dark matter haloes in which the sub-millimeter 
galaxies reside have masses of > 10 13 M . Since such mas- 
sive haloes sample the rare high-density peaks of the pri- 
mordial dark matter distribution (Kaiser 1984; Mo & White 
1996), these sources are expected to exhibit a strong spa- 
tial clustering, similar to that measured for Extremely Red 
Objects (EROs; Daddi et al. 2001, 2003), for which similar 
masses have been inferred (Moustakas & Somerville 2002). 

Direct measurements of clustering properties of SCUBA 
galaxies are made difficult by the poor statistics and by the 
fact that they are spread over a wide redshift range, so that 
their clustering signal is strongly diluted. However, tentative 
evidences of strong clustering with a comoving correlation 
length ro ~ 8-13 h~ Mpc, consistent with that of EROs, 
have been reported (Scott et al. 2002, 2006; Webb et al. 
2003; Smail et al. 2003; Blain et al. 2005). A highly statisti- 
cally significant detection of strong clustering for ultralumi- 
nous infrared galaxies over 1.5 < z < 3 has been obtained 
by Farrah et al. (2006a,b) from an analysis of optically faint 
sources selected at 24^im in three fields observed as a part of 
the Spitzer Wide-area Infrared Extragalactic (SWIRE) sur- 
vey. They found r = 14.40 ± 1.99 h" 1 Mpc for 2 < z < 3 
and r = 9.40±2.24h~ 1 Mpc for 1.5 < z < 2. Magliocchetti 
et al. (2006) found that optically unseen (R > 25.5) 24/im 
selected sources in the complete Spitzer First Look Survey 
sample (Fadda et al. 2006) are highly clustered and pointed 
out several lines of circumstantial evidence consistently indi- 
cating that they are at z ~ 2. If so, their comoving clustering 
length is r = 10.6tj j IT 1 Mpc. 



4.1 Formalism 

We model the two-point spatial correlation function, £(r, z), 
of star-forming spheroids as in Negrello et al. (2004; their 
model 1): 



£(r, z) = b 2 (M aH ,z)£ DM (r, z), 



(1) 



where b(M c s, z) is the redshift-dependent (linear) bias factor 
(Sheth & Tormen 1999; Percival et al. 2003), M cS being the 
effective mass of the dark matter halos in which the sources 
reside; £dm is the linear two-point spatial correlation func- 
tion of dark matter, determined by the power spectrum of 
primordial density perturbations as well as by the under- 
lying cosmology. For the power spectrum of the primordial 
fluctuations we adopt the fitting relations by Eisenstein & 
Hu (1998) which account for the effect of baryons on the 
matter transfer function. 

We adopt an effective halo mass of star-forming 
spheroids M e ff = 10 13 h _1 Mq, which yields a comoving 
clustering length, defined by £(ro,z) — 1, ro ~ 8h _1 Mpc, 
consistent with the available observational indications for 
SCUBA galaxies (see Negrello et al. 2004), and somewhat 
below the estimates for Spitzer 24 /im sources at z ~ 2. Note 
that, on scales below a few Mpc, clustering enters the non- 
linear regime and the slope of £ (r, z) becomes steeper than 



expected from the linear theory. In this range of scales, a 
power-law model provides a better description of the two- 
point spatial correlation function. Therefore we assume 

£(r,z) = [r/roiz)]' 1 - 8 for r < 2r vir (M cff , z) ~ 2 Mpc, (2) 

r v j r being the comoving virial radius of the characteris- 
tic halo in which star-forming spheroids reside. In view of 
the tight connection between spheroidal galaxies and ac- 
tive galactic nuclei at their centers entailed by the model 
of Granato et al. (2004), we assume the correlation length, 
ro, to be constant in comoving coordinates, as suggested by 
quasar data (Porciani et al. 2004; Croom et al. 2005). 

The angular correlation function is related to the spatial 
one, £(r,z), by the relativistic Limber equation (Peebles, 
1980): 



w{9) = 2 



J °° f~F-*(x)xW(x)tir,z)dxdu 
[Jo°° F-i(x)x*<t>(x)dx] 2 



(3) 



where x is the comoving radial coordinate, u is defined by 
r = {u 2 + [x9/(l + z)] 2 } 1 ^ 2 , F(x) gives the correction for 
curvature, and the selection function §(x) satisfies the rela- 
tion 



(4) 



poo -, roc 

JV= $(x)F- 1 (x)x 2 dx= — / N(z)dz, 
Jo S2s Jo 

in which Af is the mean surface density, f2 s is the solid angle 
covered by the survey, and N(z) is the number of sources 
within the shell (z, z + dz), given by the model. 
The errors on w(9) have been computed as: 



Sw{9) 



1 + w(9) 



(5) 



where -/V pa i r is the number of source pairs separated by an 
angular distance 6. 

The angular correlation function of intensity fluctua- 
tions due to sources below the detection limit writes (Peebles 
1980; De Zotti et al. 1996) 



C c {9) 



f 

Jo 



dz b 2 B (M ctt ,z) 



3es(z) ( dx 



d(5z)£(r,z), 



{i + zY 



rdxy 

\~dz~) 



(6) 



Zmax being the redshift when the sources begin to shine, 
z(Sd,L) the redshift at which a source of luminosity L is 
seen with a flux equal to the detection limit Sd, and 5z 
the redshift difference of sources separated by a comoving 
distance r. The effective volume emissivity j e s is expressed 



/•min[L maxL(S(j>i) ] 

j cff = / $(L,z) K(L,z) LdlogL, (7) 

$(!/, z) being the luminosity function (per unit logarithmic 
luminosity interval), K(L, z) the K-correction and L ma x and 
^min the maximum and minimum local luminosity of the 
sources. 

The angular power spectrum of the intensity fluctua- 
tions can then be obtained as 

Ci = <ja?| 2 } = / / C c (9) Pi(cos9) sinfl d8 d</> . (8) 
Jo Jo 
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Figure 7. Predicted angular correlation function, with its uncertainties (3<r Poisson error bars), of proto-spheroids detected by 250 (im 
surveys to flux limits of 25, 60, and 100 mJy, over areas of 10, 50, and 100 deg 2 , respectively, and by a 350 fim survey of 10 deg 2 to a flux 
limit of 25 mJy, averaged over bins Alog(0) = 0.4. The dilution of the signal due to the other, weakly clustered, source populations is 
taken into account. The corresponding predicted numbers of detected sources and their redshift distributions for redshift bins of Az = 0.23 
are also shown. The predicted redshift distributions of spiral and star burst galaxies (dotted and dashed histograms, respectively) are also 
shown. Surveys at 350 [im to flux limits of 60 and 100 mJy over areas of 50 and 400 deg 2 would provide only very weak constraints on 
w(9). Deep surveys, reaching the estimated confusion limits at 250 and 350 /im, can provide useful constraints on cosmological parameters, 
as illustrated by the top left-hand panels where, in addition to the predicted w(8) for our reference cosmological model, with Q m = 0.3 
and SI;, = 0.047 (dashed line), we show the effect of different choices of these 2 parameters: Q m = 0.15 and Qt = 0.045 (dotted line); 
f^m = 0.27 and £7^ = 0. (dot-short dashed line); Q m = 0.27 and Q;, = 0.09 (dot-long dashed line). 



The fractional error on the power spectrum can be estimated 
summing in quadrature the instrumental, confusion and cos- 
mic variances (Knox 1995): 

5^ _ V 2 / 2 \ ^ ( _A^_\ 

O t \AJ \2l+l) \ + NC e W e ) ' ( > 

where A (sr) is the surveyed area, a is the rms noise per 
pixel (quadratic sum of instrumental and confusion noise), 
iV is the number of pixels in the map, and We is the window 



function, describing the region of the ^-space observable with 
the considered instrument. We assume Gaussian beams with 
Full Width at Half Maximum FWHM = 2oW21n2, so that 
Wi = exp(-^ 2 ( 7|). 

In the dependence on the angular scale of both the an- 
gular correlation functions and the power spectra of back- 
ground fluctuations due to clustering, three regimes can be 
distinguished (see Cooray & Sheth 2002 for a review). On 
small scales the power spectrum probes the statistics of the 
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Figure 8. Predicted angular correlation function, with its uncertainties (3<r Poisson error bars), of proto-spheroids detected by the 
850 fim SCUBA-2 survey of 2 deg 2 to a detection limit of 2 mJy, and the corresponding redshift distribution. The dilution of the signal 
due to the other, weakly clustered, source populations is taken into account. The predicted redshift distributions of spiral and starburst 
galaxies (dotted and dashed histograms, respectively) are also shown. We have used bins of Alog(#) = 0.2 and Az = 0.23. The different 
lines in the left-hand panel correspond to the same choices of the cosmological parameters Q m and Q b as in Fig. [7] 



distribution of sources (sub-halos) within large-scale dark 
matter halos i.e. the halo occupation function, determined 
by the non-linear evolution of large-scale structure. On in- 
termediate scales, the amplitude of the signal is mostly de- 
termined by the bias parameter, which in turn depends on 
the effective mass of dark halos associated to sources. On 
very large scales, the signal provides information on the cos- 
mological evolution of primordial density fluctuations in the 
linear phase, and is thus sensitive to cosmological parame- 
ters, and in particular to the dark matter and baryon den- 
sities. 



4.2 Predictions for w(6) and Ci 

Before proceeding to describe our predictions it is interest- 
ing to compare our model with the four alternative mod- 
els for the clustering of SCUBA galaxies investigated by 
van Kampen et al. (2005). These include a simple merger 
model, a hydrodynamical model, the stable clustering model 
by Hughes & Gaztanaga (2000), and a phenomenological 
model. For each of these models van Kampen et al. (2005) 
worked out estimates of the clustering properties, tailored 
for the SHADES survey. In Fig. |S] these estimates are com- 
pared with those of the model used here. There are clear 
differences among the models, so that the analysis of clus- 
tering will help to discriminate between them. The w(9) 
predicted by our model is remarkably close to that of the 
hydrodynamical model, which assumes a redshift distribu- 
tion (the analytical form of Baugh et al. 1996) close to being 
compatible with the results by Aretxaga et al. (2007). 

Our predictions for the angular correlation function, 
w(0), and/or the power spectrum of intensity fluctuations, 
Ci , for the large area surveys defined in the § 1 are presented 
in Figs. CHIO] 

The steep counts of proto-spheroids shown by Fig. [3] 



imply that in the case of Planck/HFI and of an Her- 
SCHEl/SPIRE survey flux limited at 100 mJy at 500 /im 
the clustering signal comes entirely from sources below the 
detection limit. For the same flux limit, we expect signifi- 
cant signals both from sources above and below threshold 
at 350 and 250 /J,m. Only at the latter wavelength, however, 
a 400 deg 2 survey would provide a large enough sample of 
detected spheroidal galaxies allowing a direct study of their 
angular correlation function (see Fig. [7J. A deeper survey 
(flux limit of ~ 25 mJy) on one hand provides a substan- 
tially larger sample in a given observing time (thanks to 
the steepness of the counts of proto-spheroids in this flux 
density interval) but, on the other hand, contains a higher 
fraction of weakly clustered late-type (starburst and spiral) , 
diluting the signal. The former effect, however, prevails, so 
that, as shown by Fig. [7] we expect a rather accurate deter- 
mination of w(8) particularly at 250 /im, where the data will 
effectively probe all the three clustering regimes mentioned 
above. The number of detected sources decreases rapidly, 
for given area and limiting flux, with increasing wavelength. 
Already at 350 /im the uncertainties are rather large. On the 
other hand, studies at longer wavelengths explore higher z's 
(because of the effect of the strongly negative K-correction) . 

The SASSy survey will detect, apart from the strongly 
lensed sources discussed in §[3] mostly blazars and low-z 
late-type/starburst galaxies. The size of the samples is un- 
likely to be sufficient to detect the weak clustering of these 
populations. On the other hand, the SCUBA-2 'Wide-area 
extragalactic Survey' is well suited for the investigation of 
small- (non-linear), intermediate (sensitive to the bias pa- 
rameter), and large-scale (sensitive to cosmological param- 
eters) clustering, since it has a very good statistics and the 
clustering signal is less diluted than in the (already very fit) 
deepest 250 (im and 350 /im Herschel/SPIRE surveys con- 
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Figure 9. Power spectra of background fluctuations due to clustering of proto-spheroidal galaxies below the detection limit compared 
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of Galactic dust emission based on the Finkbeiner et al. (1999) model, averaged over a particularly dust-free 400 deg 2 area centered at 
I ~ 245°, b ~ —50°. The dashed lines show the Poisson contributions of extragalactic point sources below the detection limits. In the 
bottom right-hand panels we show, in addition to the predicted power spectrum for our reference cosmological model, with f2 m = 0.3 
and Q;, = 0.047 (dashed line), the effect of different choices of these 2 parameters: Q m = 0.15 and Qf, = 0.045 (dotted line); fi m = 0.27 
and H;, = 0. (dot-short dashed line); Q m = 0.27 and Slj, = 0.09 (dot-long dashed line). 



sidered in Fig. [7] It also covers a much wider redshift range 
(see Fig.©. 

Herschel/SPIRE surveys at 500 /jm, which are ex- 
pected not to provide large enough samples of individually 
detected high-z proto-spheroids to allow accurate estimates 
of their w(8), are however better suited than surveys at 
shorter wavelengths to estimate the autocorrelation func- 
tion of background fluctuations, from which the clustering 
power spectra can be derived. This is illustrated by Fig. [9] 
where the clustering signal is compared with competing as- 



trophysical sources, namely the Poisson fluctuations due to 
the combination of all populations of extragalactic sources 
fainter than the detection limit, and to Galactic dust emis- 
sion. To deal with the latter we have used the Finkbeiner et 
al. (1999) model, built combining IRAS and COBE data. We 
have assumed that the Herschel survey is carried out in 
the low-dust region centered at I ~ 245°, b ~ —50°. Because 
of the effect of redshift, the clustering power spectrum of 
high-z proto-spheroids dominates over that of dust (in the 
chosen region) over a multipole range increasing with in- 
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creasing wavelength. At 500 fim, the 'clean' range is broad 
enough to provide information both on the bias factor and 
on cosmological parameters in each of the three considered 
cases, but the highest accuracy is achieved by the largest 
area, shallow survey. On the contrary, the possibility of tak- 
ing full advantage of the Herschel/SPIRE resolution to 
learn about the non-linear clustering regime (whose onset is 
marked by the high-multipole inflection point) is seriously 
constrained by Poisson fluctuations. 

While the much larger area covered by Planck allows, 
in principle, a much more accurate estimate of the power 
spectrum of the clustering signal, this advantage is largely 
impaired by the contamination by Galactic dust emission. 
For example, even if we restrict ourselves to the ~ 9650 deg 2 
around the Galactic polar regions (|&| > 50°), the interstellar 
dust emission power spectrum blurs the clustering contribu- 
tion for multipoles £ 500 at 550 /im and for £ 900 at 

350 ftm (Fig. [10]). The effect of Galactic dust decreases sub- 
stantially at 850 fim, but then the power spectrum is dom- 
inated by CMB anisotropics up to I ~ 1000. On the other 
hand, it is possible to select regions of ~ 400 deg 2 where the 
Galactic dust contamination is much lower and below the 
clustering signal down to I 350 at 550 fim and to £ ^ 500 
at 350 fim. Again, at high multipoles (£ Si 2000), Poisson 

fluctuations dominate. An application of eq. <(9j shows that 
the uncertainties on the clustering power spectrum on scales 
much larger that the instrument FWHM are, at given wave- 
length, almost identical for Planck and Herschel surveys 
of the same area. The substantial increase of the error bars 
at £ ~ 2500 signals that we are approaching the limit set by 
Planck's resolution. 



5 OTHER SOURCE POPULATIONS 

5.1 Local late- type and starburst galaxies 

According to the counts shown in Fig. El an Herschel/SPIRE 
survey of 400 deg 2 to a flux limit of 100 mjy will yield ~ 
3000 low-z galaxies at 250 fim, ~ 890 at 350 fim, and ~ 
190 at 500 fim. Low-z galaxies are expected to dominate the 
counts at these bright flux levels, except perhaps at 250 fim. 
In any case, they can be easily distinguished from the other 
populations as described in § 3. The SCUBA-2 SASSy survey 
will yield about 350 low- z galaxies at 850 fim. Adopting the 
5a detection limits estimated in § 3, we find that number 
of low- z galaxies detected by Planck/HFI in the 50% of 
the sky at |6| > 30° will be of 110, 300, and 990 at 850 fim, 
550 fim, and 350 fim, respectively. 

Thus, these surveys will provide large enough samples of 
low- z galaxies to allow the first direct and accurate estimates 
of the local luminosity functions of late-type and starburst 
galaxies at several sub-mm wavelengths. 

5.2 Radio sources 

Based on the current understanding of the blazar spectra, 
the slope, /3, of the integral counts (N(> S) oc S" 13 ) of 
blazars in the relevant flux density range cannot be much dif- 
ferent from that measured by high-frequency radio surveys 
(Waldram et al. 2003; Ricci et al. 2004), i.e. ~ 1. Then, the 
number of detected sources in an observing time t increases 



as t 1 ' 2 if we use the time to go deeper over a fixed area, 
and increases as t if we cover a larger area to a fixed depth. 
Thus, a larger area shallow survey is substantially more pro- 
ductive than a deep survey over a small area. From the De 
Zotti et al. (2005) model we expect that the SASSy sur- 
vey detects ~ 1,200 blazars, and that the Planck surveys 
detect ~ 550, 130, and 50 sources at 850 fim, 550 fim, and 
350 fim, respectively, for the flux limits given in § 3 and at 
|6| > 30°. But since a key scientific goal is the identification 
of the synchrotron peak frequency, a broad spectral cover- 
age is essential and large area surveys at Herschel/SPIRE 
wavelengths would also be extremely valuable. A 400 deg 2 
survey to a flux limit of 100 mjy is expected to provide a 
complete sample of ~ 30 blazars detected at each SPIRE 
wavelength. As explained in § 3, the radio source popula- 
tions can be easily identified by a positional cross correlation 
with radio catalogues, covering the whole sky at at least 2 
frequencies. 



6 DISCUSSION AND CONCLUSIONS 

Combining the predictions of the physical model by Granato 
et al. (2004) for the formation and evolution of spheroidal 
galaxies (updated to take into account the most recent 
850 fim counts) with up-to-date phenomenological models 
for the evolution of starburst and normal late-type galaxies 
and of radio sources, we have worked out quantitative pre- 
dictions for the many important wide-area, shallow surveys 
that will be carried out in the next few years. 

We find that Planck/HFI surveys are confusion lim- 
ited at bright flux levels, with 1 a confusion noise increasing 
from ~ 70 mjy at 850 fim to ~ 260 mjy at 350 fim. These 
surveys will thus be not sufficiently deep to detect with good 
signal-to-noise ratio (S/N > 5) high-z galaxies, except in the 
case of extreme gravitational magnifications or exceptionally 
high luminosities. They will however, provide complete sam- 
ples of hundreds low-z star-forming (starburst and normal 
late-type) galaxies, enabling a direct, accurate, determina- 
tion of their local luminosity functions at sub-mm wave- 
lengths. In addition, they will detect, at least at 850 fim, 
several hundreds blazars, providing important information 
on their physical properties. 

Although the number of sub-mm sources of each pop- 
ulation detected by Planck is too small to allow a mea- 
surement of their angular correlation function (unless their 
clustering is far stronger than indicated by current data) , the 
Planck maps will allow accurate determination of the auto- 
correlation function of intensity fluctuations due to cluster- 
ing of proto-spheroidal galaxies. It will however be difficult 
to take full advantage of Planck's sky coverage because of 
the contamination by Galactic dust. Even at \b\ > 50°, the 
Galactic dust power spectrum is expected to overwhelm that 
of clustering for multipole numbers £ < 1000 in the 350 fim 
channel and for £ ^ 600 in the 500 fim channel (see Fig. I10[) . 
Nevertheless, the useful range of angular scales provides pre- 
cise information on the bias parameter of halos hosting the 
sub-mm sources, hence on their masses. 

There are however smaller low-dust regions where the 
clustering power spectrum may dominate over that of Galac- 
tic dust down to £ ~ 400-500 at 550 fim and at 350 fim, 
respectively. Such regions are large enough to allow us to in- 
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Figure 10. Power spectra of background fluctuations due to clustering of proto-spheroidal galaxies below the detection limit in the high 
frequency Planck/HFI surveys (points with error bars for bins A£ = 0A£) compared with the power spectra of the diffuse Galactic dust 
emission (broken lines), of Poisson fluctuations due to extragalactic point sources below the detection limits (dashed lines), and of the 
CMB (solid line, visible in the 850 fim panel, and in the bottom left-hand corner of the 550 /im panel). The dust power spectra are based 
on the Finkbeiner et al. (1999) model; the upper set of panels refer to the ~ 9650 deg 2 at |b| > 50°, the lower set to the 400 deg 2 low-dust 
region centered at I ~ 245°, b ~ —50°. In the bottom panels we show, in addition to the predicted power spectrum for our reference 
cosmological model, with Q m = 0.3 and Q b = 0.047 (dashed line), the effect of different choices of these 2 parameters: Q m = 0.15 and 
tt b = 0.045 (dotted line); Q, m = 0.27 and Q, b = 0. (dot-short dashed line); Q m = 0.27 and Q, b = 0.09 (dot-long dashed line). 



vestigate the regime where the shape of the clustering power 
spectrum is sensitive to the cosmological parameters (espe- 
cially to the baryon density and to the matter density), so 
that its accurate determination sets interesting constraints 
on them. At 850 /im, the dominant signal up to I ~ 1000 is 
the Cosmic Microwave Background (CMB). For high mul- 
tipoles, I > 2500, Planck's ability to measure the power 

spectrum hits the limit set by angular resolution; however, 
already at somewhat lower multipoles (£ ~ 2000) Poisson 
fluctuations overcome the clustering ones. 



One might expect that, thanks to its much higher 
angular resolution (compared to Planck/HFI) Her- 
SCHEl/ SPIRE can remove much more efficiently point 
sources and thus substantially decrease the Poisson fluctu- 
ation level. However, while it is obviously true that Her- 
SCHEl/SPIRE can reach much fainter detection limits, the 
very steep sub-mm counts of proto-spheroidal and starburst 
galaxies imply that the dominant contribution to Poisson 
fluctuations come from the still fainter fluxes where the 
slope of integral counts flattens below the critical value 
P = 2. Thus while, in principle, the Herschel/SPIRE res- 
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olution would allow us to reach very high multipoles (up to 
I ~ 2 x 10 4 ), the Poisson fluctuations are expected to dom- 
inate for x 10 3 , i.e. over the multipole range where 
1-halo contributions (i.e. the clustering of sources within a 
single dark matter halo) come out, even for surveys reach- 
ing the confusion limits. On the whole, we do not expect 
that Herschel/SPIRE surveys will add much to clustering 
power spectrum determinations provided by Planck/HFI, 
except for the HERSCHEL-only 250 /im channel. They will 
however provide independent estimates. 

The great advantage of Herschel/SPIRE over 
Planck/HFI resides in its capability of detecting large 
enough source samples to allow a direct determination of 
the angular correlation function w(9), at least at 250pm 
and at 350 /im (Fig. ■ In particular, a 250 /mi survey of 
10 deg 2 reaching a flux limit of 25 mjy effectively probes 
all the three clustering regimes mentioned in ij l4.ll As illus- 
trated by Fig.[U] measurements of the w(0) are also useful 
to discriminate between galaxy formation models (van Kam- 
pen et al. 2005; Percival et al. 2003), although the error bars 
are expected to be rather large. Surveys of different depths 
probe different redshift intervals. 

At 350 and 500 /im, Herschel/SPIRE surveys cover- 
ing ~ 400 deg 2 are necessary to extend the counts up to flux 
density levels sampled by Planck, which, at these wave- 
lengths, is confusion limited at ~ 1 Jy. A survey of a similar 
area, to a 5<r detection limit ~ 100 mjy, would also provide 
sufficient samples of z 0.1 late-type/starburst galaxies to 

estimate their local luminosity functions. 

At 500 /mi, such a very large area, shallow survey will 
also be ideal to select ~ 60 strongly lensed high-z dusty 
galaxies with an essentially 100% efficiency. As discussed in 
§ 3, the other source populations can be easily distinguished 
using only available photometric data. The simultaneous 
SPIRE surveys at shorter wavelengths with a similar de- 
tection limit will detect larger numbers of strongly lensed 
sources (e.g. we expect ~ 240 strongly lensed galaxies at 
350 /im), but singling them out will be a less easy and ef- 
ficient process, although at 350 /im the ratio of lensed to 
unlensed proto-spheroidal galaxies is estimated to be still 
high (~ 0.4). The redshift distributions of lensed sources is 
increasingly shifted to higher redshifts with increasing sur- 
vey wavelength. 

We have also estimated that the SASSy survey will de- 
tect ~ 40 strongly lensed sources, substantially brighter (in 
terms of bolometric luminosity) and with more extreme am- 
plifications than those detected by Herschel/SPIRE sur- 
veys with a flux limit of ~ 100 mjy. These sources will be 
retrievable with an essentially 100% efficiency, as in the case 
of the latter surveys. 

On the other hand, in spite of its very large area, 
the SASSy survey will probably not detect enough proto- 
spheroidal galaxies to obtain an estimate of their w(8). The 
SCUBA-2 Wide-area Extragalactic survey will be much bet- 
ter suited for this purpose and may indeed yield more ac- 
curate information on all three clustering regimes than the 
other surveys discussed here. It will also detect hundreds of 
strongly lensed proto-spheroids; these however will be out- 
numbered by a factor ~ 100 by unlensed proto-spheroids, so 
that singling them out will require a toilsome effort. 

The experience made in different contexts (e.g. analy- 



ses of Spitzer low-frequency data - Frayer et al. 2006a, b - 
or of WMAP data - Lopez-Caniego et al. 2007) has demon- 
strated that source detection algorithms, when applied to 
low-resolution maps, easily miss real sources or attribute to 
them wrong positions or misinterpret fluctuations of Galac- 
tic foregrounds or confusion noise in overdense regions as 
real sources. This implies that the subtraction of sources 
above the nominal detection limit leaves substantial residu- 
als, that, in the case of CMB experiments, add to the con- 
tamination of CMB maps. The situation improves substan- 
tially if we can take advantage of the prior knowledge of 
source positions. This means that the SASSy and a very 
large area Herschel survey is extremely beneficial for de- 
tection and photometry of Planck sources. 

For the purpose of cleaning Planck CMB maps from 
point source contamination, it should be noted that, as il- 
lustrated by the 850 /im panels of Fig. 1101 point sources are 
important on small angular scales (large £). Thus, although 
the SASSy survey will be an exceedingly important resource, 
a very large area Herschel/SPIRE survey will bring in key 
additional information. In fact, thanks to the steep rise of 
dust emission spectra with increasing frequency, a 250 /im 
survey to a flux limit of 100 mjy is deep enough to obser- 
vationally characterize both Poisson and clustering fluctua- 
tions due to all the main populations of dusty galaxies, pro- 
viding key input information for techniques, like the widely 
used Maximum Entropy Method, that need knowledge of 
the power spectra of the various components to allow an 
accurate subtraction of their contribution from CMB maps. 

The considered surveys will also provide the first com- 
plete, sub-mm selected samples of rare objects, such as 
blazars. Owing to their much flatter counts (compared to 
those of dusty galaxies), blazars are prominent in shallow 
surveys, while they are swamped by dusty galaxies at faint 
flux levels. We thus expect that the richest blazar sample 
(~ 1200 objects) will come from the SCUBA-2 SASSy sur- 
vey. The Planck 850 /mi survey should detect ~ 550 blazars 
at |6| > 30°, while the number of detections is expected 
to rapidly decrease in shorter wavelength channels, as a 
consequence of the rapidly increasing detection limit. Al- 
though Herschel/SPIRE surveys of ~ 400 deg 2 to a flux 
limit of ~ 100 mjy may detect only ~ 30 blazars, the multi- 
wavelength information would be of great value to inves- 
tigate the currently controversial issue of the "blazar se- 
quence" . 
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